SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Conneely-/ Or la M. 

Headon, Denis R. 
O'Malley, Bert W. 
May, Gregory S- 

(ii) TITLE OF INVENTION: Production of Recombinant Lactoferrin 
and Lactoferrin Polypeptides Using cDNA Sequences in 
Various Organisms 

(iii) NUMBER OF SEQUENCES: 14 

^ (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10036 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1-25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE: Concurrently herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/145,681 

(B) FILING DATE: October 28, 1993 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Albert P- Halluin 

(B) REGISTRATION NUMBER: 25,227 

(C) REFERENCE/DOCKET NUMBER: 8206-023 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-854-3660 

(B) TELEFAX: 415-854-3694 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: H. sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID n6:1: 



oi TV TV »T»rno^/^ TV r* 


r^r* o TV o 7\ 7\ 'pr* 




X V.«V^ X X X 


XwX XV^wJ.\». 


\j\J\3\3 V» -i- VJ 


60 








O X V7 X X \^i^\3 X \3 


\J X w w^^^V-'VJ X rX. 


TCCCAACCCG 


120 


TV /^r^ /^/^TV O 7\ TV TV 




Trr*/^ TV TV TV TV 


TVTivTnTvnTvan 

A X A X AVj AAVJ 


TV rsTr; pnTGG p 

AW X VJV^O X WWV« 


PPTCCITGTCA 


s 180 


X O^JiAX AAA 


VSAwAVVn^ X 


r'pr'ATr'PAGT 

V^\v^«xvX V^Va^nw X 


V3 X^^X v*wA&i\3wVir 


CATTGCGQAA 


AACAGGGCCG 


240 




CCTTGATGGT 

Xi A ^X^X WW X 


GGTTTCATAT 


ACGAGGCAGG 


CCTGGCCCCC 


TACAAACTGC 


300 


GACCTGTAGC 


GGCGGAAGTC 


TACGGGACCG 


AAAGACAGCC 


ACGAACTCAC 


TATTATGCCG 


360 


TGGCTGTGGT 


GAAGAAGGGC 


GGCAGCTTTC 


AGCTGAACGA 


ACTGCAAGGT 


CTGAAGTCCT 


420 


^GCCACACAGG 


CCTTCGCAGG 


ACCGCTGGAT 


GGAATGTGCC 


TATAGGGACA 


CTTCGTCCAT 


480 


TCTTGAATTG 


GACGGGTCCA 


CCTGAGCCCA 


TTGAGGCAGC 


TGTGGCCAGG 


TTCTTCTCAG 


c yi n 


CCAGCTGTGT 


TCCCGGTGCA 


GATAAAGGAC 


AGTTCCCCAA 


CCTGTGTCGC 


CTGTGTGCGG 


600 


GGACAGGGGA 


AAACAAATGT 


GCCTTCTCCT 


CCCAGGAACC 


GTACTTCAGC 


TACTCTGGTG 


660 


CCTTCAAGTG 


TCTGAGAGAC 


GGGGCTGGAG 


ACGTGGCTTT 


TATCAGAGAG 


AGCACAGTGT 


720 


TTGAGGACCT 


GTCAGACGAG 


GCTGAAAGGG 


ACGAGTATGA 


GTTACTCTGC 


CCAGACAACA 


780 


CTCGGAAGCC 


AGTGGACAAG 


TTCAAAGACT 


GCCATCTGGC 


CCGGGTCCCT 


TCTCATGCCG 


840 


TTGTGGCACG 


AAGTGTGAAT 


GGCAAGGAGG 


ATGCCATCTG 


GAATCTTCTC 


CGCCAGGCAC 


900 


AGGAAAAGTT 


TGGAAAGGAC 


AAGTCACCGA 


AATTCCAGCT 


CTTTGGCTCC 


CCTAGTGGGC 


960 


AGAAAGATCT 


GCTGTTCAAG 


GACTCTGCCA 


TTGGGTTTTC 


GAGGGTGCCC 


CCGAGGATAG 


1020 


ATTCTGGGCT 


GTACCTTGGC 


TCCGGCTACT 


TCACTGCCAT 


CCAGAACTTG 


AGGAAAAGTG 


1080 


AGGAGGT^GT 


GGCTGCCCGG 


CGTGCGCGGG 


TCGTGTGGTG 


TGCGGTGGGC 


GAGCAGGAGC 


1140 


TGCGCAAGTG 


TAACCAGTGG 


AGTGGCTTGA 


GCGAAGGCAG 


CGTGACCTGC 


TCCTCGGCCT ' 


1200 


CCACCACAGA 


GGACTGCATC 


GCCCTGGTGC 


TGAAAGGAGA 


AGCTGATGCC 


ATGAGTTTGG 


1260 


ATGGAGGATA 


TGTGTACACT 


GCAGGCAAAT 


GTGGTTTGGT 


GCCTGTCCTG 


GCAGAGAACT 


1320 
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ACAAA i C A 




GACCCTGATC 


CTAACTGTGT 


GGATAGACCT 


GTGGAAGGAT 


1380 


TV wj%f%tttt IV? O'T^ST 


WW X WW X X 


AGGAGATCAG 


ACACTAGCCT 


TACCTGGAAC 


TCTGTGAAAG 


1440 






GCCGTGGACA 


GGACTGCAGG 


CTGGAATATC 


CCCATGGGCC 


1500 


TGCTC 1 1 


<M.I«AWAV«.V3WW\^ 


TPPTGCAAAT 


TTGATGAATA TTTCAGTCAA AGCTGTGCCC 


1560 




^L..U.O>iurA X w X 


ZV ATPTPTGTG 

nJtxx\^ X V» X w X w 


CTCTGTGTAT 


TGGCGACGAG 


CAGGGTGAGA 


1620 


ATAA(3TV9L.V3 1 




AATGAGAGAT 


ACTACGGCTA 


CACTGGGGCT 


TTCCGGTGCC 


1680 


nvi/*t^rn/*i 7V ^ TV TV 

TGGCTGACsAA 




V9 X X w^«^% XXX w 


TGAAAGATGT 


CACTGTCTTG 


CAGAACACTG 


1740 


ATGvsAAAXAA 


r* TV ft TVS a. nGP n 


TGGGPTAAGG 


ATTTGAAGCT 


GGCAGACTTT 


GCGCTGCTGT 


1800 


GCCTCGATC?V3 


V^AAAwwV3/vV.V3 


PPTGTGAPTG 


AGGCTAGAAG 


CTGCCATCTT 


GCCATGGCCC 


1860 


CGAA J. CA. 1 GC^ 


rr* T r* n T/TP p T 

1 X o 1 ^ X 


PnGATGGATA 


AGGTGGAACG 


CCTGT^CAG 


GTGCTGCTCC 


1920 


ACC AAUAGG i- 




A G AAATGGAT 


CTGACTGCCC 


GGACAAGTTT 


TGCTTATTCC 


1980 


7v /^*r«r"pr' a 7\ 7^ 
AG i i- 1 


TV TV ZV 2V 2V PPTT 


PTGTTCAATG 

X -X. X \— *X«-xjk X VJ 


ACAACACTGA 


GTGTCTGGCC 


AGACTCCATG 


2040 


GCAAAAUAAU 




TATTTGGGAC 


CACAGTATGT 


CGCAGGCATT 


ACTAATCTGA^^^ 


2100 






CTCCTGGAAG 


CCTGTGAATT 


CCTCAGGAAG 


TAAAACCGAA 


2160 


GAAGATGGCC 


CAGCTCCCCA 


AGAAAGCCTC 


AGCCATTCAC 


TGCCCCCAGC 


TCTTCTCCCC 




AGGTGTGTTG 


GGGCCTTGGC 


TCCCCTGCTG 


AAGGTGGGGA 


TTGCCCATCC 


ATCTGCTTAC 


2280 


AATTCCCTGC 


TGTCGTCTTA 


GCAAGAAGTA 


AAATGAGAAA TTTTGTTGAA AAAAAAAAAA 


2340 


AAAAAT^AAAA 


AAAAAAAAAA 










2360 
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INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 711 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: H, sapiens 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Leu Val Phe Leu Val Leu Leu Phe Leu Gly Ala Leu Gly Leu 
15 10 15 

Cvs Leu Ala Gly Arg Arg Arg Arg Ser Val Gin Trp Cys Thr Val Ser 
20 25 30 

Gin Pro Glu Ala Thr Lys Cys Phe Gin Trp Gin Arg Asn Met Arg fSt^ 
35 40 45 

Val Arg Gly Pro Pro Val Ser Cys He Lys Arg Asp Ser Pro He Gin 
50 55 60 

Cvs He Gin Ala He Ala Glu Asn Arg Ala Asp Ala Val Thr Leu Asp 
65 70 75 80 

Gly Gly Phe He Tyr Glu Ala Gly Leu Ala Pro Tyr Lys Leu Arg Pro 
85 SO 95 

val Ala Ala Glu Val Tyr Gly Thr Glu Arg Gin Pro Arg Thr His Tyr 
100 105 110 

Tyr Ala Val Ala Val Val Lys Lys Gly Gly Ser Phe Gin Leu Asn Glu 
115 120 125 

Leu Gin Gly Leu Lys Ser Cys His Thr Gly Leu Arg Arg Thr Ala Gly 
130 135 140 

Trp Asn Val Pro He Gly Thr Leu Arg Pro Phe Leu Asn Trp Thr Gly 
145 150 155 160 

Pro Pro Glu Pro He Glu Ala Ala Val Ala Arg Phe Phe Ser Ala Ser 
165 170 175 

Cys Val Pro Gly Ala Asp Lys Gly Gin Phe Pro Asn Leu Cys Arg Leu 
180 185 190 

Cys Ala Gly Thr Gly Glu Asn Lys Cys Ala Phe Ser Ser Gin Glu Pro 

195 200 205 

Tyr Phe Ser Tyr Ser Gly Ala Phe Lys Cys Leu Arg Asp Gly Ala Gly 
210 215 220 

Asp Val Ala Phe He Arg Glu Ser Thr Val Phe Glu Asp Leu Ser Asp 
225 230 235 240 
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Glu Ala Glu Arg Asp Glu Tyr Glu Leu Leu Cys Pro Asp Asn Thr Arg 
245 250 255 

Lys Pro Val Asp Lys Phe Lys Asp Cys His Leu Ala Arg Val Pro Ser 
260 265 270 

His Ala Val Val Ala Arg Ser Val Asn Gly Lys Glu Asp Ala lie Trp 
275 280 285 

Asn Leu Leu Arg Gin Ala Gin Glu Lys Phe Gly Lys Asp Lys Ser Pro , 
290 295 300 

Lys Phe Gin Leu Phe Gly Ser Pro Ser Gly Gin Lys Asp Leu Leu Phe 
305 310 315 320 

Lys Asp Ser Ala lie Gly Phe Ser Arg Val Pro Pro Arg lie Asp Ser 
325 330 335 

Gly Leu Tyr Leu Gly Ser Gly Tyr Phe Thr Ala lie Gin Asn Leu Arg 
340 345 350 

Lys Ser Glu Glu Glu Val Ala Ala Arg Arg Ala Arg Val Val Trp Cys 
355 360 365 

Ala Val Gly Glu Gin Glu Leu Arg Lys Cys Asn Gin Trp Ser Gly Leu 
370 375 380 

Ser Glu Gly Ser Val Thr Cys Ser Ser Ala Ser Thr Thr Glu Asp Cys 
385 390 395 400 

He Ala Leu Val Leu Lys Gly Glu Ala Asp Ala Met Ser Leu Asp Gly 
405 410 415 

Gly tyr Val Tyr Thr Ala Gly Lys Cys Gly Leu Val Pro Val Leu Ala 
420 425 430 

Glu Asn Tyr Lys Ser Gin Gin Ser Ser Asp Pro Asp Pro Asn Cys Val 
435 440 445 

Asp Arg Pro Val Glu Gly Tyr Leu Ala Val Ala Val Val Arg Arg Ser 
450 455 460 

Asp Thr Ser Leu Thr Trp Asn Ser Val Lys Gly Lys Lys Ser Cys His 
465 470 475 480 

Thr Ala Val Asp Arg Thr Ala Gly Trp Asn He Pro Met Gly Leu Leu 
485 490 495 

Phe Asn Gin Thr Gly Ser Cys Lys Phe Asp Glu Tyr Phe Ser Gin Ser 
500 505 510 

Cys Ala Pro Gly Ser Asp Pro Arg Ser Asn Leu Cys Ala Leu Cys He 
515 520 525 

Gly Asp Glu Gin Gly Glu Asn Lys Cys Val Pro Asn Ser Asn Glu Arg 
530 535 540 

Tyr Tyr Gly Tyr Thr Gly Ala Phe Arg Cys Leu Ala Glu Asn Ala Gly 
545 550 555 560 

Asp Val Ala Phe Val Lys Asp Val Thr Val Leu Gin Asn Thr Asp Gly 
565 570 575 
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Asn Asn Asn Glu Ala Trp Ala Lys Asp Leu Lys Leu Ala Asp Phe Ala 
580 585 590 

Leu Leu Cys Leu Asp Gly Lys Arg Lys Pro Val Thr Glu Ala Arg Ser 
595 600 605 

Cys His Leu Ala Met Ala Pro Asn His Ala Val Val Ser Arg Met Asp 
610 615 620 

Lys Val Glu Arg Leu Lys Gin Val Leu Leu His Gin Gin Ala Lys Phe 
625 630 635 640 

Gly Arg Asn Gly Ser Asp Cys Pro Asp Lys Phe Cys Leu Phe Gin Ser 
645 650 655 

Glu Thr Lys Asn Leu Leu Phe Asn Asp Asn Thr Glu Cys Leu Ala Arg 
660 .665 670 

Leu His Gly Lys Thr Thr Tyr Glu Lys Tyr Leu Gly Pro Gin Tyr Val 
675 680 685 

Ala Gly lie Thr Asn Leu Lys Lys Cys Ser Thr Ser Pro Leu Leu Glu 
690 695 700 

Ala Cys Glu Phe Leu Arg Lys 
705 710 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bos taurus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GAGCCTTCGT 


TCCGGAGTCG 


CCCCAGGACG 


CCAGCCCATG 


AAGCTCTTCG 


TCCCCGCCCT 


60 


CCTGTCCCTT 


GGAGCCCTTG 


GACTGTGTCT 


GGCTGCCCCG 


AGGAA7UUVCG 


TTCGATGGTG 


120 


TACCATCTCC 


CAACCTGAGT 


GGTTCAAATG 


CCGCAGATGG 


CAGTGGAGGA 


TGAA6AAGCT 


^4 180 


GGGTGCTCCC 


TCTATCACCT 


GTGTGAGGCG 


GGCCTTTGCC 


TTGGAATGTA 


TTCCGGGCAT 


240 


CGCGGAGAAA 


AAGGCGGATG 


CTGTGACCCT 


GGATGGTGGC 


ATGGTGTTTG 


AGGCGGGCCG 


300 


GGACCCCTAC 


AAACTGCGGC 


CAGTAGCAGC 


AGAGATCTAT 


GGGACGAAAG 


AGTCTCCCCA 


360 


AACCCACTAT 


TATGCTGTGG 


CCGTCGTGAA 


GAAGGGCAGC 


AACTTTCAGC 


TGGACCAGCT 


420 


GCAAGGCCGG 


AAGTCCTGCC 


ATACGGGCCT 


TGGCAGGTCC 


GCTGGGTGGA 


TCATCCCTAT 


480 






TGAGCTGGAC 


AGAGTCACTC 


GAGCCCCTCC 


AGGGAGCTGT 


540 


GGCTAAATTC 


TTCTCTGCCA 


GCTGTGTTCC 


CTGCATTGAT 


AGACAAGCAT 


ACCCCAACCT 


600 


GTGTCAACTG 


TGCAAGGGGG 


AGGGGGAGAA 


CCAGTGTGCC 


TGCTCCTCCC 


GGGAACCATA 


660 


CTTCGGTTAT 


TCTGGTGCCT 


TCAAGTGTCT 


GCAGGACGGG GCTGGAGACG 


TGGCTTTTGT 


720 


TAAAGAGAC6 


ACAGTGTTTG 


AGAACTTGCC 


AGAGAAGGCT 


GACAGGGACC 


AGTATGAGCT 


780 


TCTCTGCCTG 


AACAACAGTC 


GGGCGCCAGT 


GGATGCGTTC 


AAGGAGTGCC 


ACCTGGCCCA 


840 


GGTCCCTTCT 


CATGCTGTCG 


TGGCCCGAAG 


TGTGGATGiSC 


AAGGAAGACT 


TGATCTGGAA 


900 


GCTTCTCAGC 


AAGGCGCAGG 


AGAAATCTGG 


AAAAAACAAG 


TCTCGGAGCT 


TCCAGCTCTT 


960 


TGGCTCTCCA 


CCCGGCCAGA 


GGGACCTGCT 


GTTCAAAGAC 


TCTGCTCTTG 


GGTTTTTGAG 


1020 


GATCCCCTCG 


AAGGTAGATT 


CGGCGCTGTA 


CCTGGGCTCC 


CGCTACTTGA 


CCACCTTGAA 


1080 


GAACCTCAGG 


GAAACTGCGG 


AGGAGGTGAA 


GGCGCGGTAC 


ACCAGGGTCG 


TGTGGTGTGC 


1140 


CGTGGGACCT 


GAGGAGCAGA 


AGAAGTGCCA 


GCAGTGGAGC 


CAGCAGAGCG 


GCCAGAACGT 


1200 


GACCTGTGCC 


ACGGCGTCCA 


CCACTGACGA 


CTGCATCGTC 


CTGGTGCTGA 


AAGGGGAAGC 


1260 


AGATGCCCTG 


AACTTGGATG 


GAGGATATAT 


CTACACTGCG 


GGCAAGTGTG 


GCCTGGTGCC 


1320 
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AATCCTCCAA 


ACACAGTAGC 


CTAGATTGTG 


TGCTGAGACC 


1380 


AACGGAAG6G 


TACCTTGCCG 


TGGCAGTTGT 


CAAGAAAGCA 


AATGAGGGGC 


TCACATGGAA 


1440 


TTCTCTGAAA 


GACAAGAAGT 


CGTGCCACAC 


CGCCGTGGAC 


AGGACTGCAG 


GCTGGAACAT 


1500 


CCCCATGGGC 


CTGATCGTCA 


ACCAGACAGG 


CTCCTGCGCA 


TTTGATGAAT 


TCTTTAGTCA 


1560 


GAGCTGTGCC CCTGGGGCTG ACCCGAAATC 


CAGACTCTGT 


GCCTTGTGTG 


CTGGCGATGA 


1620 


CCAGGGCCTG 


GACAAGTGTG 


TGCCCAACTC 


TAAGGAGAAG 


TACTATGGCT ATACCGGGGC 


1680 


TTTCAGGTGC 


CTGGCTGAGG 


ACGTTGGGGA 


CGTTGCCTTT 


GTGAAAAACG 


ACACAGTCTG 


1740 


GGAGAACACG 


AATGGAGAGA 


GCACTGCAGA 


CTGGGCTAAG 


AACTTGAATC 


GTGAGGACTT 


1800 


CAGGTTGCTC 


TGCCTCGATG 


GCACCAGGAA 


GCCTGTGACG 


GAGGCTCAGA 


GCTGCCACCT 


1860 


GGCGGTGGCC 


CCGAATCACG 


CTGTGGTGTC 


TCGGAGCGAT 


AGGGCAGCAC 


ACGTGAAACA 


1920 


GGTGCTGCTC 


CACCAGCAGG 


CTCTGTTTGG 


GAAAAATGGA 


AAAAACTGCC 


CGGACAAGTT 


1980 


TTGTTTGTTC 


AAATCTGAAA 


CCAAAAACCT 


TCTGTTCAAT 


GACAACACTG 


AGTGTCTGGC 


2040 


CAAACTTGGA 


GGCAGACCAA 


CGTATGAAGA 


ATATTTGGGG ACAGAGTATG TCACGGCCAT ^ 


2100 


;TGCCAACCTG AAAAAATGCT 


CAAGCTCCCC 


GCTTCTGGAA 


GCCTGCGCCT 


TCCTGACGAG 


2160 


GTAAAGCCTG 


CAAAGAAGCT 


AGCCTGCCTC 


CCTGGGCCTC 


AGCTCCTCCC 


TGCTCTCAGC 


2220 


CCCAATCTCC AGGCGCGAGG 


GACCTTCCTC 


TCCCTTCCTG AAGTCGGATT TTTGCCAAGC 


2280 


TCATCAGTAT 


TTACAATTCC 


CTGCTGTCAT 


TTTAGCAAGA 


AATAAAATTA 


GAAATGCTGT 


2340 


TGAAAAA 












2347 
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INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bos taurus 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Lys Leu Phe Val Pro Ala Leu Leu Ser Leu Gly Ala Leu Gly Leu 
15 10 15 

Cys Leu Ala Ala Pro Arg Lys Asn Val Arg Trp Cys Thr lie Ser Gin 
20 25 30 

Pro Glu Trp Phe Lys Cys Arg Arg Trp Gin Trp Arg Met Lys Lys Lei^ 
35 40 45 

Gly Ala Pro Ser lie Thr Cys Val Arg Arg Ala Phe Ala Leu Glu Cys 
50 55 60 

lie Pro Gly lie Ala Glu Lys Lys Ala Asp Ala Val Thr Leu Asp Gly 
65 70 75 80 

Gly Met Val Phe Glu Ala Gly Arg Asp Pro Tyr Lys Leu Arg Pro Val 
85 90 95 

Ala Ala Glu lie Tyr Gly Thr Lys Glu Ser Pro Gin Thr His Tyr Tyr 

100 105 110 

Ala Val Ala Val Val Lys Lys Gly Ser Asn Phe Gin Leu Asp Gin Leu 
115 120 125 

Gin Gly Arg Lys Ser Cys His Thr Gly Leu Gly Arg Ser Ala Gly Trp 
130 135 140 

lie lie Pro Met Gly lie Leu Arg Pro Tyr Leu Ser Trp Thr Glu Ser 
145 150 155 160 

Leu Glu Pro Leu Gin Gly Ala Val Ala Lys Phe Phe Ser Ala Ser Cys 
165 170 175 

Val Pro Cys lie Asp Arg Gin Ala Tyr Pro Asn Leu Cys Gin Leu Cys 
180 185 190 

Lys Gly Glu Gly Glu Asn Gin Cys Ala Cys Ser Ser Arg Glu Pro Tyr 
195 200 205 

Phe Gly Tyr Ser Gly Ala Phe Lys Cys Leu Gin Asp Gly Ala Gly Asp 
210 215 220 

Val Ala Phe Val Lys Glu Thr Thr Val Phe Glu Asn Leu Pro Glu Lys 
225 230 235 240 
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Ala Asp Arg Asp Gin Tyr Glu Leu Leu Cys Leu Asn Asn Ser Arg Ala 

245 250 255 

Pro Val Asp Ala Phe Lys Glu Cys His Leu Ala Gin Val Pro Ser His 
260 265 270 

Ala Val Val Ala Arg Ser Val Asp Gly Lys Glu Asp Leu lie Trp Lys 
275 280 285 

Leu Leu Ser Lys Ala Gin Glu Lys Ser Gly Lys Asn Lys Ser Arg Ser 
290 295 300 

Phe Gin Leu Phe Gly Ser Pro Pro Gly Gin Arg Asp Leu Leu Phe Lys 
305 310 315 320 

Asp Ser Ala Leu Gly Phe Leu Arg lie Pro Ser Lys Val Asp Ser Ala 
325 330 335 

Leu Tyr Leu Gly Ser Arg Tyr Leu Thr Thr Leu Lys Asn Leu Arg Glu 
340 345 350 

Thr Ala Glu Glu Val Lys Ala Arg Tyr Thr Arg Val Val Trp Cys Ala 
355 360 365 

Val Gly Pro Glu Glu Gin Lys Lys Cys Gin Gin Trp Ser Gin Gin Ser 

370 375 380 

Gly Gin Asn Val Thr Cys Ala Thr Ala Ser Thr Thr Asp Asp Cys lie 
385 390 395 400 

Val Leu Val Leu Lys Gly Glu Ala Asp Ala Leu Asn Leu Asp Gly Gly 
405 410 415 

Tyr lie Tyr Thr Ala Gly Lys Cys Gly Leu Val Pro Val Leu Ala Glu 
420 425 430 

Asn Arg Lys Ser Ser Lys His Ser Ser Leu Asp Cys Val Leu Arg Pro 
435 440 445 

Thr Glu Gly Tyr Leu Ala Val Ala Val Val Lys Lys Ala Asn Glu Gly 
450 455 460 

Leu Thr Trp Asn Ser Leu Lys Asp Lys Lys Ser Cys His Thr Ala Val 
465 470 475 480 

Asp Arg Thr Ala Gly Trp Asn lie Pro Met Gly Leu He Val Asn Gin 
485 490 495 

Thr Gly Ser Cys Ala Phe Asp Glu Phe Phe Ser Gin Ser Cys Ala Pro 
500 505 510 

Gly Ala Asp Pro Lys Ser Arg Leu Cys Ala Leu Cys Ala Gly Asp Asp 
515 520 525 

Gin Gly Leu Asp Lys Cys Val Pro Asn Ser Lys Glu Lys Tyr Tyr Gly 
530 535 540 

Tyr Thr Gly Ala Phe Arg Cys Leu Ala Glu Asp Val Gly Asp Val Ala 
545 550 555 560 

Phe Val Lys Asn Asp Thr Val Trp Glu Asn Thr Asn Gly Glu Ser Thr 
565 570 575 
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Ala Asp Trp Ala Lys Asn Leu Asn Arg Glu Asp Phe Arg Leu Leu Cys 
580 585 590 

Leu Asp Gly Thr Arg Lys Pro Val Thr Glu Ala Gin Ser cys His Leu 
595 600 605 

Ala Val Ala Pro Asn His Ala Val Val Ser Arg Ser Asp Arg Ala Ala 
610 615 620 

His Val Lys Gin Val Leu Leu His Gin Gin Ala Leu Phe Gly Lys Asn 
625 630 635 640 

Gly Lys Asn Cys Pro Asp Lys Phe Cys Leu Phe Lys Ser Glu Thr Lys 
645 650 655 

Asn Leu Leu Phe Asn Asp Asn Thr Glu Cys Leu Ala Lys Leu Gly Gly 
660 .665 670 

Arg Pro Thr Tyr Glu Glu Tyr Leu Gly Thr Glu Tyr Val Thr Ala lie 
675 680 685 

Ala Asn Leu Lys Lys Cys Ser Thr Ser Pro Leu Leu Glu Ala Cys Ala 
690 695 700 

Phe Leu Thr Arg n*^'^ 
705 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2259 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sus scrofa 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



ACATGAAGCT 


CTTCATCCCC 


GCCCTGCTGT 


TL.C, 1 CCjCjvjAL. 




TGTCTGGCTG 


60 


CCCCTAAGAA 


AGGGGTTCGA 


TGGTGTGTCA 


TATCCACAGC 


AGAG 1 Al J. UA 


AAATGCCGCC 


120 


AGTGGCAATC 


AAAGATAAGA 


AGAACTAATC 


CCATGTTCTG 


CATAAGGAGG 


GCTTCTCCCA IE* ^ 180 


CTGACTGTAT 


CCGGGCCATC 


GCGGCAA7VAA 


GGGCAGATGC 


TGTGACCCTT 


GATGGTGGTT 


240 


TGGTGTTTGA 


AGCAGACCAG 


TACAAACTGC 


GGCCGGTAGC 


AGCGGAGATC 


TACGGGACAG 


300 


AAGAGAATCC 


CCAAACCTAC 


TATTATGCTG 


TGGCTGTAGT 


GAAGTuU^GGT 


TTCAACTTTC 


360 


AGAACCAGCT 


ACAAGGTCGA 


AAGTCCTGCC 


ACACAGGCCT 


TGGCAGGTCT 


GCCGGGTGGA 


420 


JVTATCCCTAT 


AGGGTTACTT 


CGCCGGTTCT 


TGGACTGGGC 


AGGGCCACCT 


GAGCCCCTCC 


480 


AGA7UVGCTGT 


GGCCAAATTC 


TTCTCTCAGA 


GCTGTGTGCC 


CTGCGCAGAT 


GGAAATGCGT 


540 


ATCCCAACCT 


GTGTCAGCTG 


TGCATAGGGA 


AAGGGAAAGA 


TAAATGTGCT 


TGTTCCTCCC 


600 


AGGTU^CCGTA 


TTTTGGCTAT 


TCCGGTGCCT 


TCAACTGTCT 


GCACAAAGGG 


ATTGGAGATG 


660 


TGGCTTTTGT 


CAAGGAGAGT 


ACAGTGTTTG 


AGAACCTGCC 


ACAGAAGGCT 


GACCGGGACA 


720 


AATACGAGCT 


ACTCTGCCCA 


GACAATACTC 


GAAAGCCAGT 


GGAAGCATTC 


AGGGAGTGCC 


780 


ACCTTGCCCG 


GGTCCCTTCT 


CATGCTGTTG 


TGGCCCGAAG 


TGTGAATGGC 


AAGGAGAACT 


840 


CCATCTGGGA 


GCTTCTCTAC 


CAGTCACAGA 


AAAAGTTTGG 


AAAAAGCAAT 


CCACAGGAGT 


900 


TCCAGCTCTT 


TGGCTCTCCT 


GGTCAGCAGA 


AGGACCTCCT 


GTTTAGAGAT 


GCTACCATCG 


960 


GGTTTTTGAA 


GATCCCCTCA 


AAGATAGATT 


CTAAGCTGTA 


CCTGGGCCTC 


CCGTACCTTA 


1020 


CTGCCATCCA 


GGGCCTGAGG 


GAAACGGCAG 


CGGAGGTGGA 


GGCGCGGCAG 


GCGAAGGTCG 


1080 


TGTGGTGCGC 


CGTGGGTCCA 


GAGGAGCTGC 


GCAAGTGCCG 


GCAGTGGAGC 


AGCCAGAGCA 


1140 


GCCAGAACCT 


GAACTGCAGC 


CTGGCCTCCA 


CCACCGAGGA 


CTGCATCGTC 


CAGGTGCTGA 


1200 


AAGGAGAAGC 


TGATGCTATG 


AGCTTGGATG 


GAGGATTTAT 


CTACACTGCG 


GGCAAGTGTG 


1260 


GTTTGGTGCC 


TGTCCTGGCA 


GAGAACCAAA 


AATCTCGCCA 


AAGCAGTAGC 


TCAGACTGTG 


1320 
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TGCATAGACC 


AACACAAGGG 


TATTTTGCCG 


TGGCGGTTGT 


CAGGAAAGCA AATGGTGGTA 


1380 


TCACCTGGAA 


CTCTGTGAGA 


GGCACGAAGT 


CCTGCCACAC 


TGCTGTGGAC 


AGGACAGCAG 


1440 


GCTGGAACAT 


CCCCATGGGC 


CTGCTTGTCA 


ACCAGACAGG 


CTCCTGCAAA 


TTTGACGAAT 


1500 


TCTTTAGTCA 


TU^GCTGTGCT 


CCTGGGTCTC 


AGCCGGGATC 


CAATCTCTGT 


GCACTGTGTG 


1560 


TTGGCAATGA 


CCAGGGCGTG 


GACAAGTGTG 


TGCCCAACAG 


TAATGAGAGA 


TACTATGGTT 


1620 


ACACCGGGGC 


TTTCAGGTGC 


CTGGCTGAGA 


ATGCTGGGGA 


TGTGGCGTTT 


GTGAAAGATG 


1680 


TCACTGTCTT 


GGACAACACG 


AATGGACAGA 


ACACAGAAGA 


GTGGGCCAGG 


GAATTGAGGT 


1740 


CAGATGACTT 


TGAGCTGCTG 


TGCCTTGATG 


GCACCAGGAA 


GCCTGTGACT 


GAGGCTCAGA 


1800 


ACTGTCACCT 


GGCTGTGGCC 


CCCAGTCATG 


CTGT6GTCTC 


TCGGTUIGGAA AAGGCAGCAC 


1860 


AGGTGGAACA 


GGTGCTACTC 


ACTGAGCAGG 


CTCAGTTTGG 


AAGATACGGA AAAGACTGCC 


1920 


CGGACAAGTT 


TTGCTTGTTC 


CGGTCTGAGA 


CCAAAT^CCT 


TCT6TTCAAC 


GACAACACGG 


1980 


AGGTTCTGGC 


CCT^CTCCAA 


GGCAAAACAA 


CATACGAAAA 


ATATTTGGGA 


TCAGAGTATG 


2040 


TCACAGCCAt 


C6CTAACCTG 


A7UVCAGTGCT 


CAGTCTCCCC 


GCTTCTGGAA 


GCCTGTGCCT 


2100 


TCATGATGAG 


GTAAAACCGG 


AATAGAAGCT 


GCCCGCCTCC 


CCAGGGGCCT 


CAGCTTTCCC 


2160 


TCCTCCCGTC 


TTGATTCCCA 


GCTGCCCTGG 


GCCTGCCTCT 


CTCCCTTCCT 


GAGGGCAGAC 


2220 


TTTGTTCAGC 


TCATCCGTTT 


TCACAATTCC 


CTCGTGCCG 






2259 
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INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 703 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sus scrofa 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Lys Leu Phe lie Pro Ala Leu Leu Phe Leu Gly Thr Leu Gly Leu 
15 10 15 

Cys Leu Ala Ala Pro Lys Lys Gly Val Arg Trp Cys Val He Ser Thr 
20 25 30 

Ala Glu Tyr Ser Lys Cys Arg Gin Trp Gin Ser Lys He Arg Arg Thr 
35 40 45 

Asn Pro Met Phe Cys He Arg Arg Ala Ser Pro Thr Asp Cys He Arg 
50 55 60 

Ala He Ala Ala Lys Arg Ala Asp Ala Val Thr Leu Asp Gly Gly Leu 
65 70 75 80 

Val Phe Glu Ala Asp Gin Tyr Lys Leu Arg Pro Val Ala Ala Glu He 
85 90 95 

Tyr Gly Thr Glu Glu Asn Pro Gin Thr Tyr Tyr Tyr Ala Val Ala Val 
100 105 110 

Val Lys Lys Gly Phe Asn Phe Gin Asn Gin Leu Gin Gly Arg Lys Ser 
115 120 125 

Cys His Thr Gly Leu Gly Arg Ser Ala Gly Trp Asn He Pro He Gly 
130 135 140 

Leu Leu Arg Arg Phe Leu Asp Trp Ala Gly Pro Pro Glu Pro Leu Gin 
145 150 155 160 

Lys Ala Val Ala Lys Phe Phe Ser Gin Ser Cys Val Pro Cys Ala Asp 
165 170 175 

Gly Asn Ala Tyr Pro Asn Leu Cys Gin Leu Cys He Gly Lys Gly Lys 
180 185 190 

Asp Lys Cys Ala Cys Ser Ser Gin Glu Pro Tyr Phe Gly Tyr Ser Gly 
195 200 205 

Ala Phe Asn Cys Leu His Lys Gly He Gly Asp Val Ala Phe Val Lys 

210 215 220 

Glu Ser Thr Val Phe Glu Asn Leu Pro Gin Lys Ala Asp Arg Asp Lys 
225 230 235 240 
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Tyr Glu Leu Leu Cys Pro Asp Asn Thr Arg Lys Pro Val Glu Ala Phe 
245 250 255 

Arg Glu Cys His Leu Ala Arg Val Pro Ser His Ala Val Val Ala Arg 
260 265 270 

Ser Val Asn Gly Lys Glu Asn Ser lie Trp Glu Leu Leu Tyr Gin Ser 
275 280 285 

Gin Lys Lys Phe Gly Lys Ser Asn Pro Gin Glu Phe Gin Leu Phe Gly 
290 295 300 

Ser Pro Gly Gin Gin Lys Asp Leu Leu Phe Arg Asp Ala Thr lie Gly 
305 310 315 320 

Phe Leu Lys lie Pro Ser Lys lie Asp Ser Lys Leu Tyr Leu Gly Leu 
325 330 335 

Pro Tyr Leu Thr Ala lie Gin Gly Leu Arg Glu Thr Ala Ala Glu Val 
340 345 350 

Glu Ala Arg Gin Ala Lys Val Val Trp Cys Ala Val Gly Pro Glu Glu 
355 360 365 

Leu Arg Lys Cys Arg Gin Trp Ser Ser Gin Ser Ser Gin Asn Leu Asi^ 
370 375 380 

CVS Ser Leu Ala Ser Thr Thr Glu Asp Cys lie Val Gin Val Leu Lys 
385 390 395 400 

Gly Glu Ala Asp Ala Met Ser Leu Asp Gly Gly Phe He Tyr Thr Ala 
405 410 415 

Gly Lys Cys Gly Leu Val Pro Val Leu Ala Glu Asn Gin Lys Ser Arg 
420 425 430 

Gin Ser Ser Ser Ser Asp Cys Val His Arg Pro Thr Gin Gly Tyr Phe 
435 440 445 

Ala Val Ala Val Val Arg Lys Ala Asn Gly Gly He Thr Trp Asn Ser 
450 455 460 

Val Arg Gly Thr Lys Ser Cys His Thr Ala Val Asp Arg Thr Ala Gly 
465 470 475 480 

Trp Asn He Pro Met Gly Leu Leu Val Asn Gin Thr Gly Ser Cys Lys 
485 490 495 

Phe Asp Glu Phe Phe Ser Gin Ser Cys Ala Pro Gly Ser Gin Pro Gly 
500 505 510 

Ser Asn Leu Cys Ala Leu Cys Val Gly Asn Asp Gin Gly Val Asp Lys 
515 520 525 

Cys Val Pro Asn Ser Asn Glu Arg Tyr Tyr Gly Tyr Thr Gly Ala Phe 
530 535 540 

Arg Cys Leu Ala Glu Asn Ala Gly Asp Val Ala Phe Val Lys Asp Val 
545 550 555 560 

Thr Val Leu Asp Asn Thr Asn Gly Gin Asn Thr Glu Glu Trp Ala Arg 
565 570 575 
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